A Coherent Scrutinization on Syntactic Categories for Tagging Tamil Lexicon
نویسنده
چکیده
The arrangement of words based on rules is termed as Syntax. Natural languages have their renowned syntactic rules that demonstrate their latent features. It is attributed in a form of free word order and some have conditions on the word order arrangement. As a consequence, the smallest unit in a sentence called word or lexicon has its unique function which determines the nature of the sentence. The categorized groups of functionalities of the words are termed as syntactic categories. The syntactic categories are also termed as Parts of Speech. Numerous NLP application benefits from this syntactic information, but for morphological rich languages like Tamil, the problem of tagging the every word in a particular part of speech remain a exigent task. This paper reports about the various approaches used for developing POS tagging and the developed POS taggers particularly for the Tamil language is discussed.
منابع مشابه
Feature extraction in opinion mining through Persian reviews
Opinion mining deals with an analysis of user reviews for extracting their opinions, sentiments and demands in a specific area, which can play an important role in making major decisions in such area. In general, opinion mining extracts user reviews at three levels of document, sentence and feature. Opinion mining at the feature level is taken into consideration more than the other two levels d...
متن کاملSyntactic Category Learning as Iterative Prototype-Driven Clustering
We lay out a model for minimally supervised syntactic category acquisition which combines psychologically plausible concepts from standard NLP part-of-speech tagging applications with simple cognitively motivated distributional statistics. The model assumes a small set of seed words (Haghighi and Klein, 2006), an approach with motivation in (Pinker, 1984)’s semantic bootstrapping hypothesis, an...
متن کاملA Linguistic Analysis of Conference Titles in Applied Linguistics
Over the past twenty-five years, researchers have expressed considerable interest in titles of academic publications. Unfortunately, conference paper titles (CPTs) have only recently begun to receive attention. The aim of this study, therefore, is to investigate the text length, syntactic structure, and lexicon of CPTs in Applied Linguistics. A data set of 698 titles was selected from the 2008 ...
متن کاملA Linguistic Analysis of Conference Titles in Applied Linguistics
Over the past twenty-five years, researchers have expressed considerable interest in titles of academic publications. Unfortunately, conference paper titles (CPTs) have only recently begun to receive attention. The aim of this study, therefore, is to investigate the text length, syntactic structure, and lexicon of CPTs in Applied Linguistics. A data set of 698 titles was selected from the 2008 ...
متن کاملOn the Complexity and Typology of Inflectional Morphological Systems
We lay out a computational model for syntactic category acquisition which combines psychologically plausible concepts from minimally supervised part-of-speech tagging applications with simple distributional statistics. The model assumes a small set of seed words (Haghighi & Klein 2006), an approach with motivation in Pinker (1984)'s semantic bootstrapping hypothesis, and iteratively constructs ...
متن کامل